REPORT  DOCUMENTATION  PAGE 


Form  Approved 
OMB  No.  0704-0188 


_ I _ 

The  public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information,  including 
suggestions  for  reducing  the  burden,  to  the  Department  of  Defense,  Executive  Service  Directorate  (0704-0188).  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no 
person  shall  be  subject  to  any  penalty  for  failing  to  comply  with  a  collection  of  information  if  it  does  not  display  a  currently  valid  OMB  control  number. 


PLEASE  DO  NOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ORGANIZATION. 


1.  REPORT  DATE  (DD-MM-YYYY)  2.  REPORT  TYPE 

May  6,  201  Final  Report 

3.  DATES  COVERED  (From  -  To) 
9/8/14-12/7/15 

4.  TITLE  AND  SUBTITLE 

A  New  Mathematical  Framework  for  Design  under  Uncertanty 

5a.  con 

TRACT  NUMBER 

HR001 1-14-1-0060 

Sb.  GRANT  NUMBER 

Sc.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

George  Em  Karniadais 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Brown  University/Division  of  Applied  Mathematics 

182  George  Street 

Providence,  RI 02912 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

DARPA 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

Defense  Advance  Research  Projects  Agency 

675  North  Randolph  Street 

Arlington,  VA  22203-21 14 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

1 2.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  Public  Release,  Distribution  Unlimited 


13.  SUPPLEMENTARY  NOTES 


14.  ABSTRACT 

The  objective  of  this  seedling  project  is  to  develop  a  new  theoretical  framework  for  design  under  uncertainty  (DUU)  based  on  new  stochastic 
approaches  for  risk-averse  design  and  optimization  of  engineering  systems  under  uncertainty. 


15.  SUBJECT  TERMS 

design  under  uncertainty,  machine  learning,  novel  data  fusion 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

18.  NUMBER 
OF 

PAGES 

19a.  NAME  OF  RESPONSIBLE  PERSON 

a.  REPORT 

b.  ABSTRACT 

c.  THIS  PAGE 

George  Em  Kamiadakis 

uu 

UU 

UU 

UU 

19b.  TELEPHONE  NUMBER  (Include  area  code) 
401-863-1217 

Standard  Form  298  (Rev.  8/98) 

Prescribed  by  ANSI  Std.  Z39.18 


Adobe  Professional  7.0 


DARPA  HR001 1-14-1-0060  (Final  Report): 

A  new  mathematical  framework  for  design  under 

uncertainty 
(  9/8/14-12/7/15) 

PI:  George  Karniadakis,  Brown  University 
May  5,  2016 


1  Motivation  and  Objectives 

The  National  Defense  Authorization  Act  of  2012  identified  a  capability  gap  for  future  Special 
Operations  Forces  (SOF)  missions.  The  Act  cites  the  aging  MK  V  Special  Operations  Craft 
and  the  delayed  inception  of  the  Combatant  CraftMedium  as  a  critical  shortfall  in  the  ability  to 
perform  missions  ranging  from  maritime  interdiction  to  infiltration  and  extraction  of  personnel. 
In  order  to  fill  this  mission  capability  gap,  new  unconventional  designs  of  marine  vehicles  are 
required  that  can  adapt  to  big  waves  and  move  at  speeds  twice  as  high  as  the  existing  SOF 
vehicles.  Hence,  the  theoretical  design  of  such  SOF  vehicles  and  more  broadly  of  other  future 
DoD  applications  involving  unconventional  designs  and  resilient  systems  requires  high-fidelity 
analyses,  management  of  large  data  sets  from  a  variety  of  sources,  efficient  allocation  of  com¬ 
putational  resources,  and,  most  importantly,  reliable  quantification  of  uncertainty  inherent  in 
multi- physics  models  of  variable  fidelity,  operating  conditions  (e.g.,  extreme  state  seas),  and 
numerical  computations  as  well  as  utilization  of  such  information  in  risk-averse  decision  making. 
Even  for  classical  engineering  systems  that  can  be  described  at  various  levels  of  fidelity  and  for 
which  experimental  data  may  exist,  currently  there  are  no  mathematically  rigorous  methods  to 
combine  these  disparate  information  sources  into  a  viable  framework  for  the  purpose  of  design 
and  optimization. 

The  objective  of  this  seedling  project  is  to  overcome  these  limitations  by  developing  a 
new  theoretical  framework  for  design  under  uncertainty  (DUU),  based  on  new  stochastic  ap¬ 
proaches  for  risk-averse  design  and  optimization  of  engineering  systems  under  uncertainty.  To 
this  end,  we  combine  stochastic  multi-fidelity  models  and  other  sources  of  information  about 
system  performance  (including  experimental  data  and/or  expert  opinions)  using  novel  data  fu¬ 
sion  and  machine  learning  approaches;  see  Fig.  1.  This  will  lead  to  a  comprehensive  description 
of  system  performance  with  less  uncertainty  than  in  the  case  of  single-source/model  analysis,  it 
will  allow  the  exploration  of  broader  design  spaces,  and  it  will  enable  us  to  endow  final  designs 
with  rigorous  certificates  of  quality.  None  of  the  methods  in  current  engineering  practice  can 
consistently  account  for  the  errors  and  uncertainties  present  in  the  complex  sequence  of  analy¬ 
ses  embedded  in  the  design  process,  and,  furthermore,  key  statistical  relationships  among  those 
analyses  are  usually  ignored.  In  addition,  all  decisions  regarding  the  order  in  which  to  use  avail¬ 
able  analysis  and  optimization  tools  are  usually  based  on  experience  and  intuition,  which  often 
yields  under-performance  and  inefficient  use  of  available  resources.  The  multi-information-source 
optimization  framework  we  develop  allows  us  to  decide  on  the  timing  and  execution  of  a  large 
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Figure  1:  Sketch  of  the  multi- information-source  optimization  approach  for  the  design  of  an  unconven¬ 
tional  high-speed  marine  vehicle.  The  framework  contains  the  formulation  of  the  stochastic  optimization 
problem  and  relies  on  a  number  of  information  sources  (IS)  for  each  participating  discipline,  e.g.,  struc¬ 
tural  mechanics,  hydrodynamics,  or  material  science.  The  multiple  ISs  for  each  discipline  represent  the 
availability  of  tools  at  different  levels  of  fidelity  (larger  circles  indicate  higher  fidelity).  For  each  disci¬ 
pline,  stochastic  variable  fidelity  methods  can  be  used  to  synthesize  statistical  relationships  between  ISs 
with  the  help  of  stochastic  reduced  order  models  (ROMs)  and  error  estimation  techniques  to  enhance 
accuracy  and  manage  computational  cost. 


ensemble  of  predictive  tools  including  stochastic  simulations,  assimilation  of  noise-corrupted 
experimental  data,  and  subject-matter  expert  opinions. 

For  a  specific  demonstration  we  consider  an  unconventional  ultra-high  speed  marine  vehi¬ 
cle  suitable  to  perform  high  speed  manned  and  unmanned  operations  under  severe  sea  state 
conditions.  There  will  be  two  modes  of  operation,  the  first  targets  up  to  moderate  sea  states 
and  very  high  speeds,  without  active  control;  the  second  will  target  heavier  sea  states  where 
reduced  (but  still  high)  speed  will  allow  path  planning  in  rough  seas  to  avoid  individual  extreme 
waves  with  active  control.  Specifically,  we  will  design  and  optimize  a  new  concept  for  a  vessel 
developed  based  on  preliminary  work  on  the  so-called  Hybrid  Hydrofoil  H2-SWATH  at  MIT. 
The  craft  features  multiple  operating  modes,  e.g.,  in  displacement  mode,  the  vessel  sails  as  an 
unconventional  SWATH  at  moderate  speeds  (less  than  20  knots)  while  in  foil-borne  mode  it  can 
fold  into  the  water  the  surface-piercing/super-cavitating  (SP/SC)  hydrofoil,  hence  potentially 
reaching  120  knots  maximum  speed,  i.e.,  doubling  the  speed  of  any  existing  surface  craft.  It  can 
also  sail  at  a  slower  speed,  actively  controlled  foil-borne  mode,  e.g.  around  60  knots  in  heavy 
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seas  with  active,  real-time  path  planning  to  avoid  excessive  waves. 

For  the  seedling  project,  due  to  the  limited  time  and  scope  of  the  project  we  have  demon¬ 
strated  two  key  elements  of  the  proposed  DUU  framework  in  two  related  tasks: 

1.  Multi-fidelity  modeling  via  recursive  Co-Kriging  and  Gaussian-Markov  Random  Fields, 
and 

2.  Bayesian  optimization  of  the  most  crucial  component  of  the  H2-SWATH,  namely,  the 
supercavitating  hydrofoil. 

In  addition,  in  section  4  we  list  a  number  of  publications  resulting  from  the  current  work 
describing  the  physics-based  models  that  we  have  developed  in  the  course  of  this  seedling  project 
to  support  the  computational  simulation  work  required  in  the  design  of  the  H2-SWATH  vessel. 

2  Task  1:  Multi-fidelity  modeling  via  recursive  Co-Kriging 
and  Gaussian-Markov  Random  Fields 

In  this  task  we  have  developed  a  new  framework  for  design  under  uncertainty  based  on  stochas¬ 
tic  computer  simulations  and  multi-level  recursive  co-kriging.  The  proposed  methodology  si¬ 
multaneously  takes  into  account  multi-fidelity  in  models,  such  as  direct  numerical  simulations 
versus  empirical  formulas,  as  well  as  multi-fidelity  in  the  probability  space  (e.g.,  sparse  grids  vs. 
tensor  product  multi-element  probabilistic  collocation).  We  are  able  to  construct  response  sur¬ 
faces  of  complex  dynamical  systems  by  blending  multiple  information  sources  via  auto-regressive 
stochastic  modeling.  A  computationally  efficient  machine  learning  framework  is  developed  based 
on  multi-level  recursive  co-kriging  with  sparse  precision  matrices  of  Gaussian  Markov  random 
fields.  The  effectiveness  of  the  new  algorithms  is  demonstrated  in  numerical  examples  involving 
a  prototype  problem  in  risk-averse  design,  regression  of  random  functions,  as  well  as  uncertainty 
quantification  in  fluid  mechanics  involving  the  evolution  of  a  Burgers  equation  from  a  random 
initial  state,  and  random  laminar  wakes  behind  circular  cylinders. 

We  have  already  published  this  work  in  [3]  so  here  we  only  provide  a  simple  example  to 
demonstrate  the  capability  of  the  method.  Specifically,  we  consider  a  system  with  two  input 
design  variables  x  =  (xi,x2),  subject  to  external  uncertainties,  described  by  four  standard 
normal  random  variables  £  =  (£i,  £2,  £3,  £4)-  Let  the  response  of  this  system,  denoted  by  y(x;£) 
be  described  by  the  random  function 

/e(x;  0  =  6  sin2(5£i;ri  +  2&£3x2)  +  2£2e_Xl(6+«3)G2-o.5fi  cos2(4a;i  +  x2).  (1) 

Now,  assume  that  /e(x;  £)  returns  the  real  high-fidelity  response  but  is  expensive  to  evaluate. 
On  the  other  hand,  let  /c(x;£)  be  a  low-fidelity  cheap  to  evaluate  surrogate  model  that  we  can 
sample  extensively  as 

/c(x;  0  =  1.7 /e(x;  f)  +  2fif2  sin(aq  +  x2)  +  5 ^\e~Xl  sin(aq  +  7x2).  (2) 

Our  goal  here  is  to  employ  the  proposed  multi-fidelity  framework  to  construct  the  response 
surface  of  the  mean  field  S(x)  =  E[T'(x;£)].  In  order  to  approximate  the  expectation  operator, 
we  employ  two  methods  of  different  fidelity  in  probability  space.  To  this  end,  we  choose  our 
high-fidelity  probabilistic  method  to  be  a  Smolyak  sparse  grid  level-5  quadrature  (SG-L5)  that 
discretizes  the  four-dimensional  parameter  space  using  4,994  quadrature  points.  Similarly,  the 
low-fidelity  method  in  probability  space  is  a  coarser,  levcl-2  sparse  grid  quadrature  (SG-L2)  with 
just  57  quadrature  points. 

Therefore,  our  multi-fidelity  setup  consists  of  two  models  in  physical  space  (/e(x;  £),  /c(x;  £)), 
and  two  models  in  probability  space  (SG-L5,  SG-L2).  This  results  in  a  family  of  response  surfaces 
Sij,  that  can  be  organized  as 
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(3) 


(  Su  \  _  (  E5G.L2[/c(x;^)]  E 5G-l5[/c(x;  0]  A 

\  >521  S'22  )  V  E5G-L2[/e(x;0]  ^SG-L5  [fe  (x;  £)]  / 

Fig.  2  demonstrates  the  response  surface  produced  using  a  4-lcvel  recursive  co-kriging  scheme, 
traversing  the  available  models  and  data  in  the  order  ,Sj  i  — >  S 12  —>  S2i  S22,  i.e. ,  from  lowest 
to  highest  fidelity.  The  resulting  response  surface  is  compared  with  an  “exact”  solution  that 
is  obtained  by  Monte  Carlo  integration  of  the  high-fidelity  physical  model  /e(x;£)  using  106 
samples.  Evidently,  the  highly  non-linear  response  of  the  mean  field  is  captured  remarkably  well 
by  just  using  five  observations  of  the  expensive  highest-fidelity  model  S22,  supplemented  by  a 
number  of  inaccurate  low-fidelity  observations  from  (Su,  S\2,  S2i).  This  observation  is  further 
confirmed  by  the  uncertainty  of  the  predictor,  quantified  by  the  co-kriging  variance  (see  Fig.  2, 
inset),  which  is  bounded  by  10-3.  To  underline  the  merits  of  using  the  proposed  multi-fidelity 
approach  note  that  the  relative  h2  error  between  the  exact  solution  and  the  4-level  co-kriging 
predictor  is  10-2.  This  is  in  sharp  contrast  with  the  corresponding  error  from  fitting  a  kriging 
model  through  the  highest  fidelity  observations  S22,  which  here  is  as  high  as  0.5. 

3  Task  2:  Bayesian  optimization  of  super-cavitating  hy¬ 
drofoils 

The  goal  of  this  study  is  to  demonstrate  the  capabilities  of  statistical  learning  and  information 
fusion  in  carving  a  robust  and  tractable  workflow  for  design  optimization  under  uncertainty.  As  a 
prototype  problem  we  have  considered  the  shape  optimization  of  a  2D  super-cavitating  hydrofoil. 
We  have  not  yet  published  this  work  so  here  we  outline  the  basic  methodology  for  reconstructing 
response  surfaces  for  quantities  of  interest  from  the  output  of  multi-fidelity  information  sources. 
The  key  concepts  behind  the  proposed  framework  are  Gaussian  process  regression  and  auto¬ 
regressive  information  fusion  as  developed  in  task  1,  see  above.  This  probabilistic  setting  enables 
the  accurate  reconstruction  of  response  surfaces  from  scattered  variable  fidelity  observations,  and 
allows  us  to  identify  the  functional  relation  between  inputs  and  outputs  with  quantified  error 
bars.  The  latter  is  an  essential  ingredient  in  Bayesian  optimization  where  the  propagation  of 
input  and  model  uncertainties  through  the  system  dynamics  needs  to  be  rigorously  quantified. 

3.1  Formulation 

Our  aim  is  to  accurately  characterize  the  response  surface  of  various  quantities  of  interest  by 
seamlessly  combining  low-fidelity  approximations  that  can  be  obtained  with  very  low  computa¬ 
tional  cost  (for  e.g.  potential  flow  solvers),  with  a  few  accurate  realizations  of  high-fidelity  models 
that  are  much  more  expensive  to  compute  (for  e.g.  RANS  simulations).  This  is  achieved  by  ex¬ 
ploring  spatial  correlations  between  input  variables  (see  Fig.  3),  and  cross-correlations  between 
different  variable  fidelity  models  through  a  stochastic  auto-regressive  representation  and  recur¬ 
sive  co-kriging.  In  what  follows  we  provide  a  brief  overview  of  the  recursive  co-kriging  scheme 
recently  put  forth  by  Le  Gratiet  et.  al.  [2]  -  a  more  efficient  version  of  the  well-known  auto¬ 
regressive  inference  scheme  proposed  by  Kennedy  and  O’Hagan  [1]  in  the  context  of  predicting 
the  output  from  a  complex  computer  code  when  fast  approximations  are  available.  The  reader 
is  referred  to  [1,  2,  3],  and  the  references  therein,  for  a  detailed  exposition  to  the  theoretical  and 
implementational  aspects  of  this  methodology. 
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Co-kriging  variance  x  10-4 


Figure  2:  Exact  response  surface  S  =  E[F(x;£)]  and  co-kriging  predictor  constructed  using 
four  levels  of  fidelity:  80  Sn  points  (/c(x;£),  SG-L2),  40  Su  points  (/c(x;£),  SG-L5),  10  S21 
points  (/e(x;£),  SG-L2),  and  5  S2 2  points  (/e(x;£),  SG-L5).  The  inset  plot  shows  the  point-wise 
variance  of  the  co-kriging  predictor. 
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In  general,  suppose  we  have  s  levels  of  information  sources  producing  outputs  yt(xt),  at 
locations  xt  E  Dt  C  Md,  sorted  by  increasing  order  of  fidelity.  The  main  idea  here  is  to  model 
the  Nt  scattered  observations  yt(xt)  of  a  quantity  of  interest  Yt(x)  as  a  realization  of  a  Gaussian 
random  field  Zt(x).  The  observations  could  be  corrupted  by  modeling  errors  or  measurement 
noise  £(x),  which  is  thereby  assumed  to  be  a  zero-mean  Gaussian  random  field,  i.e  £(x)  ~ 
A/^O,  of/).  Regression  is  performed  by  assigning  a  prior  distribution  on  over  fields  Zt(x),  and 
calibrating  in  view  of  data  yt(xt).  The  Gaussian  prior  of  Zt(x)  is  completely  characterized  by  a 
mean  held  /it(x)  =  E[Z(x)]  and  an  auto-corelation  function  Kt(x,  x']9t),  where  9t  is  a  vector  of 
hyper-parameters,  to  be  learned  from  the  data  yt(xt).  Typically,  the  choice  of  the  prior  reflects 
our  belief  on  the  structure,  regularity,  and  other  intrinsic  properties  of  the  quantity  of  interest 
Yt(x). 

If  we  sort  the  outputs  yt(xt )  by  increasing  order  of  fidelity,  then  the  auto-regressive  scheme 
of  Kennedy  and  O’Hagan  [1]  reads  as 

Zt(x)  =  pt_i(x)Zt_i(x)  +  <5t(x),  t  =  2, ...,  s,  (4) 

where  St(x)  is  a  Gaussian  held  independent  of  {Zt- 1, . . . ,  Z{\  and  distributed  as  St  rv_/ 

Also,  p(x)  is  a  scaling  factor  that  quantifies  the  correlation  between  (Zt(x),  Zt-i(x)j.  The  recent 
contributions  of  Le  Gratiet  et.  al.  [2]  suggest  replacing  the  Gaussian  held  Zt_i(x)  in  Eq.  4  with 
a  Gaussian  held  Zt_i(x)  that  is  conditioned  on  all  known  observations  {yt-i,yt-2,  ■  ■  ■ ,  2/i }  up  to 
level  (t  —  1),  while  assuming  that  the  corresponding  experimental  design  sets  /},:,  i  —  1, . . . ,  t  —  1 
have  a  nested  structure,  i.e.  D\  C  D2  Q  ■  ■  ■  Q  Dt- 1.  This  essentially  allows  to  decouple  the 
s-level  auto-regressive  co-kriging  problem  to  s  independent  kriging  problems  that  can  be  effi¬ 
ciently  computed  and  are  guaranteed  to  return  a  predictive  mean  and  variance  that  is  identical 
to  the  coupled  scheme  of  Kennedy  and  O’Hagan  [1], 

Once  Zt(x )  has  been  trained  on  the  observed  data  {yt,  yt-i,  ■  ■  ■ ,  yi}  through  maximum  like¬ 
lihood  estimation,  the  optimal  set  of  hyper-parameters  {/h,  <3f ,  of  ,  pt-i,  Ot }  is  known  and  can  be 
used  to  evaluate  the  predictions  yt,  as  well  as  to  quantify  the  prediction  variance  tf  at  all  points 
in  x*  (see  [2]  for  a  derivation), 


yt(x*t)  =  pt  +  (x£)  +  rTt{Rt  +  a\l)  ^^(x*)  -  lpt  -  pt-iyt-i(xt)}, 


„2  i  i-2 

/k-lSt-l(Xf  )  +  at 


1  -  rj (Rt  +  a2eJ)  1rt  + 


[1  -rJ{Rt  +  dll)-Ytn 
1  J{Rt  +  alI)~nt  _ 


(5) 

(6) 


where  Rt  =  Kt(xt,  x'q  9t)  is  the  Nt  x  Nt  correlation  matrix  of  Zt{x),  rt  =  Kt(xt,  x*;  9t)  is  a  1  x  Nt 
vector  containing  the  correlation  between  the  prediction  and  the  Nt  training  points,  and  lt  is 
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a  1  x  Nt  vector  of  ones.  Also,  Kt(x.t,  x%  dt)  is  the  auto-correlation  kernel  that  quantifies  spatial 
correlations  at  level  t,  with  corresponding  hyper-parameters  Qf 

3.2  Results 

In  this  study,  the  quantities  of  interest  are  the  drag  to  lift  coefficient  ratio  ^A,  and  the  ratio 

t10 

between  the  cavity  thickness  at  10%  of  the  chord  over  the  minimum  cavity  thickness,  The 

tc 

recursive  co-kriging  scheme  is  employed  using  two  levels  of  fidelity,  with  low-fidelity  observations 
originating  from  sampling  a  potential  flow  solver,  while  high-fidelity  data  is  obtained  though 
RANS  simulations.  Our  goal  is  to  reconstruct  an  accurate  representation  of  the  response  surfaces 
that  provide  the  functional  relation  between  each  quantity  of  interest  and  the  15  input  variables 
defining  the  shape  of  the  hydrofoil  (see  Fig.  3).  To  this  end,  we  have  a  constructed  a  predictive 
inference  scheme  by  training  a  squared  exponential  auto-correlation  kernel  on  a  subset  of  the 
available  observations.  In  all  cases  we  have  considered  a  training  set  consisting  of  300  randomly 
sampled  low  fidelity  observations  (potential  flow  solver),  supplemented  with  60  realizations  of 
the  high  fidelity  code  (RANS).  The  accuracy  of  the  resulting  probabilistic  predictors  is  then 
tested  against  the  full  set  of  898  high  fidelity  observations.  In  both  cases,  the  total  CPU  time 
for  training  and  predictions  was  about  15  minutes  on  a  single  core  laptop. 

The  scatter  plots  of  Fig.  4  provide  a  qualitative  illustration  of  the  predictive  capacity  of  the 
resulting  multi-fidelity  surrogates.  In  particular,  Fig.  4(a)  depicts  the  output  y  of  the  co-kriging 
predictor  for  100  x  ^A  evaluated  at  all  test  locations,  versus  the  output  y  of  the  high-fidelity 
RANS  solver  at  these  898  points.  Similarly,  in  Fig.  4(b)  contains  the  results  corresponding  to 

(  t10  \ 

the  co-kriging  surrogate  for  (1  —  jSsj-  We  observe  that  all  predicted  points  ( black  cirlces)  are 

tightly  clustered  around  the  diagonal  ( dashed  line),  indicating  that  the  surrogates  where  able  to 
accurately  identify  the  input-output  relations  for  both  quantities  of  interest. 


y 


-1  -0.8  -0.6  -0.4  -0.2  0  0.2 

y  y 

(a)  (b) 

Figure  4:  Testing  the  ability  of  mutli-fidelity  co-kriging  surrogates  to  predict  the  desired  quan¬ 
tities  of  interest  against  the  corresponding  output  of  high-fidelity  RANS  simulations.  Scatter 

C  f  A9  \ 

plots  for:  100  x  ^  (left),  and  ^1  -  (right). 
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Figures  5,  6  measure  the  accuracy  of  the  resulting  milt-fidelity  surrogates  in  predicting  the 
density  distribution  generated  from  the  898  test  outputs  of  the  high-fidelity  RANS  simulator. 
In  particular,  Fig.  5(a)  shows  the  densities  corresponding  to  the  “exact”  high-fidelity  RANS 
output,  the  multi-fidelity  co-kriging  predictor,  and  a  kriging  model  trained  using  only  300  low- 
fidelity  observations.  The  multi-fidelity  surrogate  is  able  to  capture  the  density  distribution 
remarkably  well,  even  at  the  more  challenging  region  around  the  tails  (as  depicted  in  log-scale 
in  Fig.  5(b)).  It  is  also  evident  that  using  a  kriging  surrogate  trained  on  solely  low  fidelity 
observations  leads  to  inaccurate  predictions,  thus  highlighting  the  importance  of  employing  a 
multi- fidelity  approach.  Finally,  these  conclusions  also  hold  for  the  density  estimation  results 
corresponding  to  (1  —  ),  as  depicted  in  Fig.  6(a), (b). 


— 898  High  fidelity  samples 
---Co-kriging  (300  low  fidelity  +  60  high  fidelity) 
—  Kriging  (300  low  fidelity  samples) 


— 898  High  fidelity  samples 
-  -  -Co-kriging  (300  low  fidelity  +  60  high  fidelity) 


(a) 


(b) 


Figure  5:  Testing  the  ability  of  multi-fidelity  co-kriging  surrogates  to  predict  the  desired  quan¬ 
tities  of  interest  against  the  corresponding  output  of  high-fidelity  RANS  simulations.  Density 

estimation  for  100  x  — —  in  linear  (left)  and  logarithmic  scales  (right).  Results  corresponding  to 

Cl 

a  kriging  surrogate  trained  using  only  300  low  fidelity  observations  are  included  to  highlight  the 
significant  advantages  of  a  multi-fidelity  approach. 


— 898  High  fidelity  samples 

---Co-kriging  (300  low  fidelity  +  60  high  fidelity) 

—  Kriging  (300  low  fidelity  samples) 


— 898  High  fidelity  samples 

---Co-kriging  (300  low  fidelity  +  60  high  fidelity) 

—  Kriging  (300  low  fidelity  samples) 


(a)  (b) 

Figure  6:  Testing  the  ability  of  multi-fidelity  co-kriging  surrogates  to  predict  the  desired  quan¬ 
tities  of  interest  against  the  corresponding  output  of  high-fidelity  RANS  simulations.  Density 
/  tio  \ 

estimation  for  f  1  —  j  in  linear  (left)  and  logarithmic  scales  (right).  Results  corresponding 

to  a  kriging  surrogate  trained  using  only  300  low  fidelity  observations  are  included  to  highlight 
the  significant  advantages  of  a  multi-fidelity  approach. 
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